skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Hao, Yue"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Free, publicly-accessible full text available April 1, 2026
  2. The ciliate genus Paramecium served as one of the first model systems in microbial eukaryotic genetics, contributing much to the early understanding of phenomena as diverse as genome rearrangement, cryptic speciation, cytoplasmic inheritance, and endosymbiosis, as well as more recently to the evolution of mating types, introns, and roles of small RNAs in DNA processing. Substantial progress has recently been made in the area of comparative and population genomics. Paramecium species combine some of the lowest known mutation rates with some of the largest known effective populations, along with likely very high recombination rates, thereby harboring a population-genetic environment that promotes an exceptionally efficient capacity for selection. As a consequence, the genomes are extraordinarily streamlined, with very small intergenic regions combined with small numbers of tiny introns. The subject of the bulk of Paramecium research, the ancient Paramecium aurelia species complex, is descended from two whole-genome duplication events that retain high degrees of synteny, thereby providing an exceptional platform for studying the fates of duplicate genes. Despite having a common ancestor dating to several hundred million years ago, the known descendant species are morphologically indistinguishable, raising significant questions about the common view that gene duplications lead to the origins of evolutionary novelties. 
    more » « less
  3. Rogers, Rebekah (Ed.)
    Abstract Whole-genome duplications (WGDs) have shaped the gene repertoire of many eukaryotic lineages. The redundancy created by WGDs typically results in a phase of massive gene loss. However, some WGD–derived paralogs are maintained over long evolutionary periods, and the relative contributions of different selective pressures to their maintenance are still debated. Previous studies have revealed a history of three successive WGDs in the lineage of the ciliate Paramecium tetraurelia and two of its sister species from the Paramecium aurelia complex. Here, we report the genome sequence and analysis of 10 additional P. aurelia species and 1 additional out group, revealing aspects of post-WGD evolution in 13 species sharing a common ancestral WGD. Contrary to the morphological radiation of vertebrates that putatively followed two WGD events, members of the cryptic P. aurelia complex have remained morphologically indistinguishable after hundreds of millions of years. Biases in gene retention compatible with dosage constraints appear to play a major role opposing post-WGD gene loss across all 13 species. In addition, post-WGD gene loss has been slower in Paramecium than in other species having experienced genome duplication, suggesting that the selective pressures against post-WGD gene loss are especially strong in Paramecium. A near complete lack of recent single-gene duplications in Paramecium provides additional evidence for strong selective pressures against gene dosage changes. This exceptional data set of 13 species sharing an ancestral WGD and 2 closely related out group species will be a useful resource for future studies on Paramecium as a major model organism in the evolutionary cell biology. 
    more » « less
  4. Genetic variants of mitochondrial DNA at the individual (heteroplasmy) and population (polymorphism) levels provide insight into their roles in multiple cellular and evolutionary processes. However, owing to the paucity of genome-wide data at the within-individual and population levels, the broad patterns of these two forms of variation remain poorly understood. Here, we analyze 1,804 complete mitochondrial genome sequences from Daphnia pulex, Daphnia pulicaria, and Daphnia obtusa. Extensive heteroplasmy is observed in D. obtusa, where the high level of intraclonal divergence must have resulted from a biparental-inheritance event, and recombination in the mitochondrial genome is apparent, although perhaps not widespread. Global samples of D. pulex reveal remarkably low mitochondrial effective population sizes, <3% of those for the nuclear genome. In addition, levels of population diversity in mitochondrial and nuclear genomes are uncorrelated across populations, suggesting an idiosyncratic evolutionary history of mitochondria in D. pulex. These population-genetic features appear to be a consequence of background selection associated with highly deleterious mutations arising in the strongly linked mitochondrial genome, which is consistent with polymorphism and divergence data suggesting a predominance of strong purifying selection. Nonetheless, the fixation of mildly deleterious mutations in the mitochondrial genome also appears to be driving positive selection on genes encoded in the nuclear genome whose products are deployed in the mitochondrion. 
    more » « less
  5. Abstract By modeling the homoeologous gene losses that occurred in 50 genomes deriving from ten distinct polyploidy events, we show that the evolutionary forces acting on polyploids are remarkably similar, regardless of whether they occur in flowering plants, ciliates, fishes, or yeasts. We show that many of the events show a relative rate of duplicate gene loss before the first postpolyploidy speciation that is significantly higher than in later phases of their evolution. The relatively weak selective constraint experienced by the single-copy genes these losses produced leads us to suggest that most of the purely selectively neutral duplicate gene losses occur in the immediate postpolyploid period. Nearly all of the events show strong evidence of biases in the duplicate losses, consistent with them being allopolyploidies, with 2 distinct progenitors contributing to the modern species. We also find ongoing and extensive reciprocal gene losses (alternative losses of duplicated ancestral genes) between these genomes. With the exception of a handful of closely related taxa, all of these polyploid organisms are separated from each other by tens to thousands of reciprocal gene losses. As a result, it is very unlikely that viable diploid hybrid species could form between these taxa, since matings between such hybrids would tend to produce offspring lacking essential genes. It is, therefore, possible that the relatively high frequency of recurrent polyploidies in some lineages may be due to the ability of new polyploidies to bypass reciprocal gene loss barriers. 
    more » « less
  6. Abstract Model species continue to underpin groundbreaking plant science research. At the same time, the phylogenetic resolution of the land plant Tree of Life continues to improve. The intersection of these two research paths creates a unique opportunity to further extend the usefulness of model species across larger taxonomic groups. Here we promote the utility of the Arabidopsis thaliana model species, especially the ability to connect its genetic and functional resources, to species across the entire Brassicales order. We focus on the utility of using genomics and phylogenomics to bridge the evolution and diversification of several traits across the Brassicales to the resources in Arabidopsis, thereby extending scope from a model species by establishing a “model clade”. These Brassicales-wide traits are discussed in the context of both the model species Arabidopsis thaliana and the family Brassicaceae. We promote the utility of such a “model clade” and make suggestions for building global networks to support future studies in the model order Brassicales. 
    more » « less
  7. Abstract Understanding why various organisms evolve alternative ways of living requires information on both the fitness advantages of phenotypic modifications and the costs of constructing and operating cellular features. Although the former has been the subject of a myriad of ecological studies, almost no attention has been given to how organisms allocate resources to alternative structures and functions. We address these matters by capitalizing on an array of observations on diverse ciliate species and from the emerging field of evolutionary bioenergetics. A relatively robust and general estimator for the total cost of a cell per cell cycle (in units of ATP equivalents) is provided, and this is then used to understand how the magnitudes of various investments scale with cell size. Among other things, we examine the costs associated with the large macronuclear genomes of ciliates, as well as ribosomes, various internal membranes, osmoregulation, cilia, and swimming activities. Although a number of uncertainties remain, the general approach taken may serve as blueprint for expanding this line of work to additional traits and phylogenetic lineages. 
    more » « less